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Abstract 

We present a general methodology to incorporate fundamental economic factors 
to our previous theory of herding to describe bubbles and antibubbles. We start 
from the strong form of Rational Expectation and derive the general method to 
incorporate factors in addition to the log-periodic power law (LPPL) signature of 
herding developed in ours and others' works. These factors include interest rate, 
interest spread, historical volatility, implied volatility and exchange rates. Standard 
statistical AIC and Wilks tests allow us to compare the explanatory power of the 
different proposed factor models. We find that the historical volatility played the 
key role before August of 2002. Around October 2002, the interest rate dominated. 
In the first six months of 2003, the foreign exchange rate became the key factor. 
Since the end of 2003, all factors have played an increasingly large role. However, 
the most surprising result is that the best model is the second-order LPPL without 
any factor. We thus present a scenario for the future evolution of the US stock 
market based on the extrapolation of the fit of the second-order LPPL formula, 
which suggests that herding is still the dominating force and that the unraveling 
of the US stock market antibubble since 2000 is still qualitatively similar to (but 
quantitatively different from) the Japanese Nikkei case after 1990. 
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1 Introduction 



As an extension of [25], since October 2002, we have posted an analysis of the 
US (and later other) markets after the collapse of the "information technology 
bubble," forecasting a continuation of the downward correction, decorated 
with ups and downs organized according to a pattern we call "log-periodicity." 
Monthly updates 2 are available from October 31, 2002 till November 17, 2004. 

Our projections have been based on the generalization to "bearish" markets 
(we call them "antibubbles" when they follow market bubble peaks) of the 
concept of positive feedback based on imitation and herding that our team 
has demonstrated mostly for bubbles preceding financial crashes (see [22] for 
a review and references therein). 

In December 2004, we decided to discontinue the update, concluding that, 
after more than two years, our projections for the US market have not been 
verified, while they have been good from the vantage of foreigners as well 
as for other foreign developed markets. In December 2004, we attributed our 
failure to forecast the rebound of the market in 2003 to the fact that we had 
neglected all factors except the imitation and herding behavior of investors. In 
particular, our approach had neglected the fundamental input of the economy 
as well as the external manipulations by the Fed and other institutions, the 
impact of foreign policy and foreign investors, the dollar effect (all these being 
possibly inter-related). Our projections had taken the extreme view that all 
these actions are endogenously determined and driven by the collective action 
of the investors. We thought this is too simplistic because, in general, one can 
understand the evolution of system as the result of an entangled combination of 
endogenous organization but also as a response to external news and exogenous 
shocks (see [23] for a recent quantitative discussion on the issue of endogeneity 
versus exogeneity). In December 2004, we concluded that our live experiment 
from Oct. 2002 until Nov. 2004 had clearly demonstrated the limits of this 
approximation. 

We announced at the web site that we had found a methodology to incorporate 
the Feds action and the dollar effect in an extended version of our theory, to 
combine our herding theory with all these other factors. The first purpose of 
the present paper is to present this methodology. We start from the strong 
form of Rational Expectation and derive the general formalism to incorporate 
factors in addition to the log-periodic power law (LPPL) signature of herding. 
Then, our first tests show that rational expectation should be partially relaxed 
and we present several models combining LPPL with different factors (interest 
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rate, interest spread, historical volatility, implied volatility, exchange rates). 
One particularly interesting theoretical result is that this formalism allows 
us to justify that the exchange rate factor is equivalent for a certain value 
of parameters to take the view of foreigners as empirically discovered in our 
monthly updates. We also present several standard statistical AIC and Wilks 
tests to compare the different proposed factor models. 

The most surprising result is in fine that we have to prefer an extension of the 
LPPL formula, the so-called second-order LPPL first introduced in [24], to all 
other possible models including economic factors. We thus present a scenario 
for the future evolution of the US stock market based on the extrapolation of 
the fit of the second-order LPPL formula. 



2 Model 1: Rational Expectation bubble model with a single volatil- 
ity factor and log-periodicity 

2.1 The stochastic discount factor 

The theory of stochastic pricing kernel [4] provides a unified framework for 
pricing consistently all assets under the premise that the price of an asset is 
equal to the expected discounted payoff, while capturing the macro-economic 
risks underlying each security's value. This theory amounts to postulating 
the existence of a stochastic discount factor (SDF) M that allows one to 
price all assets consistently. The SDF is also termed the pricing kernel, the 
pricing operator, or the state price density and can be thought of as the 
nominal, inter-temporal, marginal rate of substitution for consumption of a 
representative agent in an exchange economy. Under an adequate definition of 
the space of admissible trading strategies, the no-arbitrage condition translates 
into the condition that the product of the SDF with the value process p(t), 
of any admissible self-financing trading strategy implemented by trading on a 
financial asset, must be a martingale: 



where t' refers to a future date. The no-arbitrage condition (1) expresses that 
pM is a martingale for any admissible price p. Technically, this amounts simply 
to imposing that the drift of pM is zero. 

Let us assume as a first example the following simple dynamics for the SDF, 
which will lead to Model 1 for the price dynamics: 



M(t)p(t) = E t W)M(t>)} 



(1) 



dM(t) 
M(t) 



r(t) dt - (f>(t) dW(t) + g(t)dW . 



(2) 
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The drift —r(t) of M is justified by the well-known martingale condition on 
the product of the bank account and the SDF [4]. Intuitively, it retrieves the 
usual simple form of exponential discount with time. The process </> denotes 
the market price of risk, as measured by the covariance of asset returns with 
the SDF. Stated differently, <p is the excess return over the spot interest rate 
that assets must earn per unit of covariance with W, hence the negative sign 
in front of <fr(t) dW(t) in (2) which leads to a positive contribution to the 
return \i as we now show. The last term g(t)dW embodies all other stochastic 
factors acting on the SDF, which are orthogonal to (uncorrelated with) the 
stochastic process dW. 

For the sake of pedagogy and clarity, let us provide a straightforward derivation 
of Ito's calculus applied to the condition that the process pM is a martingale. 
For this, we form the expectation of 

pit + dt)M(t + dt) - p(t)M(t) 

p(t)M(t) { } 

whose drift (the term proportional to dt) must be zero for the no-arbitrage 
condition to hold. This reads 



E 



p(t + dt)M(t + dt) - p(t)M(t) 
, p(t)M(t) 



E 



E 



E 



(p(t) + dp) (M(t) + dM) - p{t)M(t) ' 

p(t)M(t) 
p(t)dM + M(t)dp + dMdp~ 



p(t)M(t) 
dM dp dM dp 
M + p + M p 



(4) 



2.2 Price dynamics with rational expectation (RE) and the stochastic dis- 
count factor 



The price dynamics is written in the standard form 

— = fi(t)dt + a(t)dW - Kdj , (5) 
p 

where dW is the increment of a random walk with no drift and unit variance 
and dj is the jump process equal to in absence of a crash and equal to 1 
when a crash occurs. The crash hazard rate h(t) is given by 

E t [dj] = h(t)dt (6) 
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by definition, where E t [x] denotes the expectation of x conditioned on all 
information available at time t. 

To obtain the RE price dynamics, we determine fi(t) such that the the process 
pM be a martingale, that is, the factor proportional to dt of the last term of 
(4) 

fi(t) - r(t) - nh(t) - a(t)(f)(t) , (7) 

be zero. This leads to 

/i{t) - r(t) = Kh{t) + a{t)<p{t) , (8) 

where we have used that E t [dW] = = E t [dW ■ dW], E t [dW 2 ] = dt and 
(6). The expected price conditioned on no crash/rally occurring (dj = 0) is 
obtained by integrating (5) with (8): 

EtX*)] =K*o) L(t) exp ^£dr/i(r)) , (9) 

where 

L(t) = exp Qf dr [r(r) + <r(r)0(r)]) . (10) 
For r(t) = <fr(t) = 0, L(t) = 1, we recover the previous formulation [11,15,10]. 

Expression (9) with (10) represents the dynamics of the asset price p(t) as the 
results of three contributions: 

• the interest rate r(t), which provides as usual the reference background 
against which to compare stock market price returns, 

• the price volatility a(t) priced in proportion of the market price of risk 
<f)(t) of the SDF, the later embodying the positive relation between risk and 
return suggested by most asset pricing models [17], 

• the crash hazard rate, through another risk-return positive relationship 
emerging from the negative price drop associated with a crash. 



2.3 Fit with the LPPL model for the crash hazard rate h{t) 

Knowing the empirical data of the interest rate r(t) and the volatility, we 
use equation (9) with (10) augmented with a model of the crash hazard rate 
(and assuming a simple dynamics for <f)(t)) to fit the empirical price time series 
p(t). In this section, we use the simplest specification that <fi(t) = <fi is constant. 
Here, we model the crash hazard rate as explained in [15,10] (for a general 
presentation and specific derivation of the crash hazard rate from herding, 
see [22]). In a nutshell, the crash hazard rate is modeled by a log-periodic 
power law (LPPL) which captures the intermittent positive feedback during 



5 



imitation regimes of bubbles before crashes and antibubbles developing after 
a bubble. This leads to modeling the integral of h(t) in the exponential term 
with the following LPPL formula 

re [ drh(r) = B\t c - t\ rn + C\t c - t\ m cos[ujln \t c - t\ -if>] . (11) 

Putting a(t) and r(t) to zero retrieves previous works consisting in fitting the 
logarithm of the price by the LPPL formula. 

Let us rewrite the formula (9) and (10) in the following form 

y(t) = ln\p(t)} - f r(r)dr = hx\p(t )] + (f> f a(r)dr + re f drh(r) (12) 

Jto Jto Jto 

where we replace E to [p(t)} by the realized price pit). Substitution of Eq. (11) 
into (12) results in 



where 



y(t) = A + cj>v(t) + Bf(t) + Cg(t) 



y(t) = ln[p(t)] - JJ r(r)dr , 

A = ln[p(t )] , 
v(t) = Jla(r)dr , 
f(t) = \t c -t\ m , 

git) = \t c — t\ m cos[u;ln \t c — t\ — ip] 



(13) 



(14) 



We use the following procedure. 



(1) We take pit) to represent the S&P500 index price. 

(2) We specify r(t) as being the yield of the three-month Treasury bill. 

(3) We take the CBOE Volatility Index (VIX) as the proxy for the volatility 
factor a(t), which is one of the world's most popular measures of investors' 
expectations about future stock market volatility (that is, risk) 3 . 

(4) to is the beginning time of the interval over which the reduced price time 
series y(t) is fitted. It is chosen a priori as explained below. 

(5) We perform an OLS fit of y(t) with the free parameters A, B, C, (ft, t c , m, u, ip. 
Note that A,B,C are linear parameters which can be slaved to the others. 

is not a linear parameter because it must remain positive or zero 4 



3 See htt p:/ /www.cboe.com/ for more information on the VIX. 

4 It is however possible to slave <f> as a linear parameter and then check at the end 
of the OLS fit if it is positive. If yes, we accept the fit, if not, we put = and 
then redo the fit without this parameter. 
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We use the trapezoid scheme for integration of // r(r)dr and // cr(r)dr: 



Using Eqs. (13,14,15), we have fitted the USA S&P 500 price in its antibub- 
ble regime since year 2000 (see [25,26,28,30] for previous related works on 
the behavior of the market after the crash of March-April 2000) and the re- 
sult is shown in Fig. 1. The r.m.s. of the fit residuals is x = 0.20, which is 
very large compared with the pure LPPL fit, showing that Model 1 is a bad 
representation of the price. 

In addition to the large value of the r.m.s. of the fit residuals, there is another 
diagnostic for the bad quality of Model 1: the three different contributions 
(LPPL, VIX and interest rate r(t)) are large compared with their sum, sug- 
gesting the existence of spurious (in the sense of "over-fitting") compensations 
between these terms. In contrast, since the LPPL term was shown to fit very 
well the data over the period from 2000 to 2003 [25,26,28,30], the LPPL term 
should be the leading contribution while the other factors should have been 
perturbations, perhaps growing with time. In other words, Model 1 is not a 
perturbation of the LPPL model used previously in [25,26,28,30]. And the 
reason for this is that the interest rate contribution f* o r(r)dr is given and its 
amplitude cannot be modulated within the RE formulation. Since the risk-free 
interest rate has been always positive, its integral grows with time, given a 
contribution with a trend opposite to the overall negative trend of the price 
over the period 2000 to 2003; hence, the need for the LPPL contribution to 
counter-balance this effect and the ensuing bad fit and probably strong over- 
fitting. 



3 Model 2: Weaker semi-martingale condition on prices 

Generalizing Model 1 to include other factors does not improve, because the 
positive contribution of the risk-free interest rate (which is an essential com- 
ponent in the RE formulation of the pricing model) requires the other terms 
and in particular the LPPL component to adjust to large negative values to 
compensate its large positive contribution. The risk-free interest rate turns 
out to have an overwhelming and counter trending role in the bearish phase 
of the market with declining prices. In the context of the RE formalism, a de- 
clining price would then require very strong antagonistic forces to compensate, 
leading to unrealistic impacts of other factors and of the LPPL contribution. 
Adding other factors do not provide realistic explanations of the price tra- 
jectory over this period. We do not show the fits and corresponding figures 




[r(r- 1) +r(r)]/2 



(15) 
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obtained with all the factors we have tested, as they teach a lesson similar 
to that of Fig. 1. We conclude that it is essentially impossible to obtain a 
good model of the empirical price trajectory using the no-arbitrage rational 
expectation bubble model enriched with factors such as interest rate spread, 
volatility and exchange rates. 

We are thus forced to remove the no-arbitrage condition and replace it by a 
weaker condition. This amounts to assuming that the market is incomplete. 
Indeed, we find that the technical source of the problems in fitting the em- 
pirical data with our model is the very large cumulative contribution of the 
risk-free interest rate r(t). Recall that this contribution comes from the no- 
arbitrage condition obtained under a change of probability measure from the 
objective to a 'risk-neutral' probability measure (a priori distinct from the 
objective one), which changes the price process from a semi-martingale into a 
martingale [8,9]. We thus propose to change the no-arbitrage condition, which 
is equivalent to requiring that the prices are martingales, by the condition 
that the prices follow a semi-martingale, with the drift of the semi-martingale 
taken proportional to the risk-free rate. In the setting of incomplete markets, 
the fair price is not attainable as the particular expectation over a unique risk- 
neutral measure, but rather as a supremum over an infinite set of equivalent 
martingale measures [16]. Our specification for the drift of our semi-martingale 
price amounts to reducing the set of equivalent measures as in a variational 
set-up. We allow this drift to adjust its amplitude to reflect the existence of 
price deviations which have not been arbitraged away, possibly due to tech- 
nical constraints such as transaction costs and short-sale constraints, but also 
because of the reluctance of arbitragers to take risks against the crowd senti- 
ments. In other words, the abandon of the no-arbitrage condition reflects the 
non-fully rational behavior of investors. 

The transition to incomplete markets amounts to change (10) into 



where a is a new adjustable parameter. Expression (9) with (16) and (11) 
define our Model 2, whose fit to the price time series of the the S&P500 
already shown in Fig. 1 is presented in Fig. 2. The r.m.s. of the fit residuals is 
X = 0.0482. The improvement of model 2 compared with model 1 is twofold 
and very significant. First, model 2 better captures the evolution of the S&P 
500 index with a much smaller value of the r.m.s. of the fit residuals. Second, 
the relative amplitudes of the impact of interest and VIX are much smaller 
than in model 1. However, we see that the impact of VIX is still very large 
compared with the LPPL factor. This calls for further modification of the 
model. 




(16) 
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4 Addition of other factors 



4-1 Description of other potentially relevant factors 



The introduction of the price volatility a(t) in Models 1 and 2 of the previ- 
ous section was justified by the assumed positive relation between risk and 
return suggested by most asset pricing models [17]. There is a large empirical 
literature that has tried to establish the existence of such a tradeoff between 
risk and return for stock market indices. As summarized in [6], the results 
have been inconclusive and the relation between risk and return is often found 
insignificant, and sometimes even negative. However, with better monthly vari- 
ance estimates based on past daily squared returns, the ICAPM intertemporal 
relation between the conditional mean and the conditional variance of the ag- 
gregate stock market return has recently been found positive [5] . One possible 
reason for the difficulties in measuring a positive risk-return relation is that 
other state variables in addition to the volatility may influence the investment 
opportunities anticipated by investors [2,21]. In particular, variables related 
to the business cycle have been found to predict returns [3]. They include 
the dividend-price ratio, the relative Treasury bill rate, the default spread 
(difference between the yield on BAA and AAA-rated corporate bonds, ob- 
tained from the FRED database), the Fama- French factors Rm, SMB, and 
HML 5 , the exchange rates and so on. The excess return Rm on the market is 
calculated as the value-weighted return on all NYSE, AMEX, and NASDAQ 
stocks (from CRSP) minus the one-month Treasury bill rate (from Ibbotson 
Associates). SMB (Small Minus Big) is the average return on three portfolios 
made of assets with small capitalizations minus the average return on three 
portfolios made of assets with big capitalizations. HML (High Minus Low) is 
the average return on two value portfolios minus the average return on two 
growth portfolios. In addition, determinants of investors' risk aversion identi- 
fied in the asset pricing literature are economic growth prospects, measures of 
equity and credit market risk, fluctuations in the exchange rate and negative 
news events in other equity markets. If any of these factors are anticipated to 
forecast or influence future returns, they should be incorporated in the return 
equation. We thus discuss extensions of Model 2 which incorporate different 
additional factors as now explained. 



4-2 Model 3: Adding the exchange rate factor 



The fact that exchange rates (for instance euro/$, i.e., the value of one euro 
in dollars) is a relevant factor in the price equation is suggested empirically 



5 See the website at http : //mba . tuck . dartmouth . edu/pages/f acuity/ken . f rench/ data_library . html 
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by a live experiment to forecast the S&P500 index that we have performed 
from Oct. 2002 till Nov. 2004. This experiment led to the suggestion that the 
S&P500 index was more predictable when expressed in euros or other major 
foreign currencies than in US dollars. The rational for this observation is mul- 
tifold. First the conversion of the dollar value of the US stock market into euros 
reflects more the view point of foreign investors, which constitute a growing 
fraction of the population of investors, especially at times of speculative bub- 
bles and more generally herding [27], but also due to the growing influence of 
debts at times of bulging US deficits. Second, valuing in euro provides a sim- 
ple mean of eliminating (at least in part) the dollar effect resulting from the 
policy of the Fed (and in particular its liquidity inputs). Third, the relevance 
of the exchange rate as an important factor to forecast returns may reflect 
the expectation that a lower dollar means a more competitive US economy, 
with its many spillovers on market shares, job creation, earnings, investments 
and productivity gains and so on. Inversely, a weaker dollar may actually be 
worrying because it favors the resumption of inflation, higher interest rates 
which may slow down the economy, potential loss of confidence of foreign in- 
vestors on the sustainability of the US deficits, varying profits for companies 
trading internationally and so on. The exchange rate is also responding to 
global trade deficit of the US versus the rest of the world. It is often advanced 
that the progressive collapse of the US dollar since 2002 is in large part driven 
by the growing US deficits (Federal debt, private debt, and current account) 
[18]. These effects are in general going in opposite directions but with different 
time scales. 



Let us thus write 

P$(t) = Pcum (t)e R ^ {t) , (17) 

which defines the exchange rate return between the dollar and the euro. 
Putting for instance p eur o = 1 gives the number of dollars equal to e Rcxch 
we obtain from one euro at time t. Now, the risk perceived by the economic 
agents is not determined by e Rc * ch( -^ or R cxc h(t) but by the relative temporal 
variations of p$(t) / p euTO (t) . We have i? exc h(^) = hip$(t) — In p curo (t) and thus 

m_ , c /,w,, dp$(t)/dt dp curo (t)/dt 
r ex ch{t) = dR exch (t)/dt = — — — . (18) 

P${t) Peuro(^) 

^exch(^) defined in (18) quantifies the real risks due to changes in exchange 
rates. Indeed, an investor long in dollars (say, he owns one unit of dollars) has 
the equivalent wealth of e~ Rc * ch( -^ euros at time t, which becomes e _iJcxch ^ +d ^ 
euros at time t + dt. He makes a relative profit or loss (return) in euro equal 
to 

= -r-*(t)dt . (19) 

Of course, the situation is symmetric for an investor having one euro and mea- 
suring his wealth variation in dollars, leading to a profit or loss of +r cxc h(t)<it 
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over one elementary time step dt. 



We thus generalize the market price of risks 

<t>(t)dW(t) -> <j)dW + Vc XC hr cxch (t)dW cxch , (20) 

where r/exch sets the scale of the impact of the risks associated with the ex- 
change rate fluctuations. Note that ?7 cxc h can be either positive or negative. 
Correlatively, we generalize the price dynamics (5) to make explicit its depen- 
dence on the factors identified in the market price of risks: 

= fi(t)dt + a™ ch dW oxch + a(t)dW - ndj . (21) 

The total volatility (variance) of the price returns [erf* 011 ] 2 + [#"(£)] 2 is the sum 
of two contributions, one resulting from the fluctuations associated with the 
exchange rate and the other which captures all other factors of uncertainty. 
Then, the semi-martingale condition leads to change (8) into 

fi(t) = ar(t) + Kh(t) + ja(t) + a cxch r cxch (t) , (22) 

where a exc h = ^ e xchO"p Xch - The expected price conditioned on no crash/rally 
occurring (dj = 0) is of the form (9) with 

L(t) = exp Qf dr [ar{r) + 7a + awchfcxch]) • (23) 

Note that 7 > 0, while a cxc h can be of any sign, depending whether the risk 
factor is r exc h or its negative — r exc h- The choice of one or the other amounts to 
analyzing the exchange rate from dollars to euros or vice- versa. This symmetry 
allows for an a priori arbitrary sign for r/ cxc h- 



According to (17) and (18), 

P${t) Pcuro(^o) 



/ dr r cxch (t) = In 

Jtn 



_Pcuro(t) P$ (t ) 

Then, for the special choice a cxc h = ?7exchC"p Xch = 1, we have 



(24) 



^Qoxch Jt Q dr r cxch("r) _ P$(t) Peuro(tp) 
Peuro(t) P$(to) 



In this case, expression (9) with p(t) understood as p$(t) is replaced by 

ft 

1 to 



Et [p euro (t)] =Pcuro(t ) L{t) exp (k£ dTh(r)j , (26) 
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where 



L{t) = exp ( / dr [ar{r) + 7c (r)] 



to 



(27) 



In words, equation (26) means that the term exp N t * dr [o: exc hrexch]) in (23) 
embodying the exchange risk factor is absorbed in the transformation from 
dollars to euros. Model 3 thus provides a justification for expressing the US 
stock market from the view point of the euro as we have done in our monthly 
online predictions (our monthly predictions used L{t) = 1). This procedure 
is nothing but the particular case (a exc h = ^cxch^ 011 = 1) of Model 3 which 
incorporates the risk factor associated with the exchange rates between the 
US dollar and other currencies. By generalizing the description of the US stock 
market viewed not solely from the vantage of foreigners, Model 3 allows for 
a market price of risk associated with the exchange rate which is not entirely 
described in terms of the euro view point. 



We implement Model 3 numerically as: 

y{t) = A + ar(t) + >yv(t) + a cxch R{t) + Bf(t) + Cg(t) 

where 



{21 



yit) 


= ln[p(t)] , 


A 


= ln[p(to)] , 


r{t) 


= Jt r 3m (r)dr , 


v{t) 


= & ^(r)dr , 


R{t) 


= ft o r exch (r)dr , 


/(*) 


= {t c ~ t) m , 


k 9® 


= {t c - t) rn cos[uj\n{t c - t) 



(29) 



Figure 3 shows the evolution of the S&P 500 index from Aug-20-2000 to Mar- 
21-2005 fitted with Model 3. The r.m.s. of the fit residuals is 0.0482. We notice 
that this figure is undistinguishable from Fig. 2. Both model 2 and model 3 
gives the same r.m.s. of fit residuals and the fitted parameters are quite close 
to each other. In the inset, we see that the impact of the foreign exchange rate 
can be neglected. Notwithstanding the decrease of the interest rate during the 
antibubble period, which is causally slaved by the market [29], the short term 
interest rate has increasing negative impact on the stock market. 

From Fig. 3, the impact of interest rate is still very large, we thus fit a modified 
model 3 containing LPPL, VIX, FX but without the interest rate r{t) (forcing 
a = 0). The result is shown in Fig. 4. This new figure emphasizes just the 
effect of the FX, without the interfering effect of the interest rate r{t) and 
again the impact of VIX can be neglected. 
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4-3 Model 4-' Semi-Martingale bubble model with additional factors 



It is natural to extend Model 2 defined by (9) with (23), by adding other 
known factors of risks. We explore the relevance of the interest rate spread 
and of the historical volatility. This amounts to change (23) into 



L = exp ( / dr [or + ja + a cxch r cxch + a h istO"hist + "spread (ri 0y - r 3m )] (30) 



where a^t and a sp read are two new adjustable parameters. Note again that 7 
and Q!hist are taken positive. r 10y — r 3m is the difference in yields between the 
Treasury bill with 10 year and 3 month maturities. 

Proceeding as before for the OLS fit of this Model 4 to the S&P500 price 
trajectory, we obtain Fig. 5. The r.m.s. of the fit residual is 0.0409, which is 
smaller than the value 0.0436 of the second-order Landau fit to the same data 
set (which was up to now the best, i.e., smallest r.m.s) and of course much 
smaller than the r.m.s. 0.0601 of fit residuals using the first-order LPPL model. 
The inset shows the amplitudes of the contributions of the different factors 
to the global fit. It shows that, in addition to the LPPL factor previously 
considered in [25,26,28,30], the short-term interest rate and the interest rate 
spread are the dominant factors, while the other three factors are weaker. The 
contribution of a (or VIX) and the historical volatility are actually completely 
negligible since the values of 7 and «hi S t is very small and close to zero. 

Notwithstanding the overall quality of this fit shown in Fig. 5, we argue that it 
should be disqualified for the same reasons as discussed in section 2.3 concern- 
ing Fig. 1: the spread and interest rate contributions are quite large, which 
forces the LPPL contribution to increase with time. This LPPL contribution 
thus exhibits a trend opposite to the initial simple LPPL fit and, therefore, 
Model 4 cannot be considered as a perturbation to the simple LPPL model. 

4-4 Second- order Landau formula 

An extension of the LPPL formulas (11) and (12) described in section 2.3 
has been proposed in [24], based on a formulation of the critical power law 
behavior in terms of a Landau-like expansion 



where the coefficients are generally complex and r = \t — t c . Keeping only the 
first term aF(r) of the right-hand-side of (31) with a = m + iuo retrieves (13) 
without the term 4>v(t), which is the so-called first-order LPPL formula [22]. 




dF(r) 



aF{r)+(3\F{xr)\ 2 F{T)... , 



(31) 



dlnr 
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Including the second-order term j3\F \t)\ 2 F '(r) leads to [24] 



I(t) = A + 



Br m + Cr m cos - 


a;lnr+^ln 


. 1 + te) 


+ <#>} 


I 





(32) 



which was used to model the bubbles before the USA 1929 crash and 1987 
crash [24]. A third order LPPL formula was used to model the nine- year Nikkei 
antibubble since January 1990 and predict the ensuing two-year evolution 
[13,14]. 



Equation (32) predicts the transition from the angular log-frequency uj + A w 
for A t < \t — t c \ to the angular log-frequency uj close to t c . This corresponds 
to an approximate description of a log-frequency modulation. For instance, 
the 1990 Nikkei antibubble studied in [13,14] experienced the transition from 
the first-order Landau description (11) and (12) to the second-order Landau 
formula (32) approximately 2.5 years after the inception of the antibubble 
[13,14]. In contrast, the 2000 S&P 500 antibubble was found to just barely 
enter the second-order Landau regime on the fourth quarter of 2003 after more 
three years since its inception [30], while the second-order Landau regime was 
undetectable from data ending in the last quarter of 2002 [26] . 



In the following tests, we compare between them the different models con- 
structed with different factors and with the second-order LPPL formula (32). 



5 Tests on the relevance of the different factors 



Let us consider the family of one-factor LPPL models defined by 

L = exp Qf aifidr^j , with i = 1,2, 3, 4, and 5, (33) 

where atifx = ar, a 2 f 2 = a 3 f 3 = a cxch r cxch , a 4 f 4 = tthistOhist, and a 5 f 5 = 
ttspread( r io y ~ r 3m)- We fit the S&P 500 antibubble after the burst of the "new 
economy" bubble in April of 2000 using each of the five models defined by 
Eq.(33). The time series is fitted from Aug-20-2000 to a time ii as t which is 
varied from 08-Apr-2002 to 10-Mar-2005 with a step of 21 trading days. Each 
interval from Aug-20-2000 to a time ti ast is fitted by each of the five models. 
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5.1 Akaike's AIC criterion 



To compare these five models with the first-order LPPL and second-order 
LPPL, we use Akaike's minimum AIC estimation [1], which was designed for 
the identification of several competing models. The AIC is defined by 

AIC = -2 ln(L) + 2k , (34) 

where L is the maximum likelihood of a given model given the data and k is the 
number of independently adjusted parameters within a model. By assuming a 
Gaussian distribution of the fit residuals, we have 

Tl 

InL = -nln(ff) - -[ln(27r) + 1] , (35) 

where n is the number of data points and a 2 = \ 2 is the maximum likelihood 
estimate of the variance of the fit residual. 



The top panel of Fig. 7 shows the evolution of the AIC's of the five one-factor 
LPPL models in comparison with the first- and second-order Landau LPPL 
models. To better illustrate the competition among the models, we plot the 
relative AIC with respect to the first-order LPPL model in the lower panel of 
Fig. 7. This figure suggests that the historical volatility played the key role 
before August of 2002. Around October 2002, the interest rate dominated. In 
the first six months of 2003, the foreign exchange rate became the key factor. 
Since the end of 2003, all factors have played an increasingly large role. Note 
also that after July 2003, the second-order Landau LPPL model has the largest 
AIC and is identified as the best model, suggesting as discussed elsewhere 
[25,30] a progressive maturation of the herding effect in this "antibubble," 
similarly to the case of Japan [12,14]. 

We should stress however that this qualitative similarity between the US an- 
tibubble since 2000 and the Japanese antibubble since 1990 does not extend 
to the quantitative level. In [26] written in 2003, we stated that we could 
not find neither quantitative nor qualitative differences between the US and 
the JP antibubbles. Things have changed in the last two years. Our present 
analysis shows that the second-order LPPL regimes of the two antibubbles 
are quite different; for instance, the important parameter Au have different 
signs and significantly different values; the A t 's are also very different. We 
conjecture that these quantitative differences may signal deep divergences in 
the mechanisms and evolutions of the two antibubbles in the present regime 
described by the second-order LPPL model and beyond. 
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5.2 Wilks' test of embedded hypotheses 



Since the single factor models defined with expression (33) contain Model 
1 defined by (13) as a special case a, = 0, we can use Wilks theorem [20] 
and the statistical methodology of nested hypotheses to assess whether the 
hypothesis that = can be rejected. By assuming a Gaussian distribution 
of observation errors (residuals) at each data point, the maximum likelihood 
estimation of the parameters amounts exactly to the minimization of the sum 
of the square over all data points (of number n) of the differences Sj (i) between 
the mathematical formula and the data [19]. According to Wilks theorem of 
nested hypotheses, the log-likelihood-ratio 

T = -2(L - Li) = 2n(ln o x - In a Q ) , (36) 

is a chi-square variable with k degrees of freedom, where k is the number of 
restricted parameters [7]. In the present case, we have k = 1. The Wilks test 
thus amounts to calculate the probability that the obtained value of T can 
be over-passed by chance alone. If this probability is small, this means that 
chance is not a convincing explanation for the large value of T which becomes 
meaningful. This implies a rejection of the hypothesis that «j = is sufficient 
to explain the data and favor the fit with «j ^ as statistically significant. 

Figure 8 plots the evolution of the probability Pr that the log-likelihood- 
ratio exceeds T at ti ast . The second-order Landau formula is also tested for 
comparison. We see that there are always factors that are significant at a level 
of significance far better than 99%, that is, Pr < 1%. For instance, 04/0 
(historical volatility) passed the test in the time period before August of 2002. 
Since 2004, all five one-factor models and the second-order Landau model are 
significant. 

It is noteworthy that the second-order Landau formula experienced a tran- 
sition from large Pr to small Pr in the first quarter of 2003, signaling the 
crossover from the first-order Landau regime to the second-order regime, which 
confirms our earlier preliminary assessment [30]. In contrast, the one-factor 
models exhibit large values of Pr for several months in 2003. 

Together with the fact that the r.m.s. of the residues of the fit is the smallest 
for the second-order LPPL formula (32), we conclude that the second-order 
LPPL model is the best model. The economic factors investigated here seem 
less powerful than just herding effects to account for the continuation of the 
unraveling of the US stock market antibubble. 
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6 Prediction using the second-order LPPL formula (32) 

So far, we have shown that the macroscopic factor models are not as good as 
the second-order formula. For the US S&P 500 antibubble started in August 
2000, the second-order formula (32) provides by far the best model so that 
the other factors cannot explain what is going on in the US stock market. Our 
analysis also shows that the antibubble has entered the second-order Landau 
regime approximately during the summer of 2003, confirming our preliminary 
analysis [29]. In addition, the tests in [29] showed that, in the framework of the 
first-order Landau model, the antibubble was probably still alive in August 
2003 but has ended since in the USA (i.e., when viewed from the view point of 
a US investor valuing in US dollars). The timing of the end of the first-order 
LPPL antibubble is roughly consistent with our current analysis. 

To further show that the crossover happened in 2003, we show in Fig. 9 the 
evolution of the fitted values of A t and A w which are the diagnostic of the 
relevance of the second-order formula. The dramatic drop of the value of At 
endorses the crossover from the first-order regime to the second-order. 

These tests suggest that it is possible to resume the modeling and prediction of 
the antibubble within the second-order Landau model. Fig. 10 shows the fit of 
the US S&P 500 index from 2000/08/21 to 2005/03/21 using the second-order 
Landau formula. The fitted inception date of the antibubble t c = 2000/08/17 is 
found stable in all our fits with different terminal values of the fitted windows. 
The extrapolation of the fitted second-order formula shown in Fig. 10 suggests 
that the market is not yet ready for a solid recovery. 
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Fig. 1. Comparison of the S&P500 index price with the best fit obtained with 
Eq. (12), Eq. (13) and Eq. (15). The fitted parameter values are the following: 
t c = 2000/07/09, m = 0.44, uj = 7.83, V = 2.78, A = 18.70, (f> = 5.16 x 10~ 4 , 
B = —1.93, and C = —0.03. The inset shows the impact of different factors. 
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Fig. 2. Comparison of the S&P500 index price with the best fit of model 2 defined 
by expression (9) with (16) and (11). The fitted parameter values are the following: 
t c = 2000/09/30; m = 1.22, u = 4.68, ip = 3.62, = 0, A = 7.32, a = -1.58 x 10" 2 , 
B = 4.58 x 10~ 6 , and C = 3.51 x 10~ 5 . The r.m.s. of the fit residuals is 0.0482. 
The inset shows the impact of different factors. The the LPPL factor is reduced or 
translated by the amount of A for a better comparison. 
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Fig. 3. Comparison of the S&P500 index price with the best fit of model 3 defined 
by expression (9) with (16) and (11). The fitted parameter values are the following: 
t c = 2000/10/17; m = 1.25, w = 4.52, ijj = 4.83, 7 = 0, A = 7.31, a = -1.55 x 10" 2 , 
a exc h = 2.02 x 10~ 2 , B = 2.68 x 10~ 6 , and C = 2.98 x 10~ 5 . The r.m.s. of the 
fit residuals is 0.0482. The inset shows the impact of different factors. The LPPL 
factor is reduced or translated by the amount of A for a better comparison. 
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Fig. 4. Comparison of the S&P500 index price with the best fit of the modified 
model 3. The fitted parameter values are the following: t c = 2000/09/25; m = 0.65, 
oj = 4.60, ip = 3.99, j = 0, A = 7.29, a exch = -0.502, B = -5.55 x 10" 3 , and 
C = 1.93 x 10~ 3 . The r.m.s. of the fit residuals is 0.0553. The inset shows the impact 
of different factors. The LPPL factor is reduced or translated by the amount of A 
for a better comparison. 
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Fig. 5. Evolution of the S&P 500 index from Aug-20-2000 to Nov-20-2003 with 
the fit to Model "weak". The values of the fit parameters are the following: 
t c = 2000/09/09; m = 1.30, u = 7.80, r/j = 0.82, 7 = 0, a hist = 0, A = 7.31, 
a = -3.89 x 10~ 2 , a spread = -7.64 x 10" 2 , a cxch = 0.345, B = 2.83 x 10" 4 , 
C = 9.88 x 10~ 6 . The r.m.s. of the fit residuals is 0.0409. The inset shows the im- 
pact of different factors contributing to the overall fit of Model 4. The LPPL factor 
is reduced or translated by the amount of A for a better comparison. 
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Fig. 6. Fit to the modified model of all factors except the interest rate. The values of 
the fit parameters are the following: t c = 2000/07/16; m = 0.80, oj = 5.44, iji = 4.17, 
7 = 0, ahist = 9.11 x 10~ 7 , A = 7.39, a sprcad = 1.54 x 10~ 2 , a cxch = -0.182, 
B = -3.23 x 10" 3 , and C = 5.84 x 10" 4 . The r.m.s. of the fit residuals is 0.0517. 
The inset shows the impact of different factors contributing to the overall fit of 
Model 4. The LPPL factor is reduced or translated by the amount of A for a better 
comparison. 
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Fig. 7. (Top panel) AIC of the five one-factor LPPL models with comparison to the 
first- and second-order Landau LPPL models. (Bottom panel) Relative AIC against 
the AICq of the first-order LPPL model. 
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Fig. 8. Wilks' log-likelihood-ratio test of the significance of the new factors intro- 
duced in (33). The second-order Landau formula is also tested. Note that all values 
of Pr which are less than 0.001 are plotted to be 0.001 in the figure for better 
representation. 
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Fig. 9. The evolution of the fitted values of At and A w . The dramatic drop of the 
value of At endorses the crossover from the first-order regime to the second-order. 
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Fig. 10. Modeling and prediction of the US S&P 500 index from 2000/08/21 to 
2005/03/21 using the second-order Landau formula (32). The fitted parameters are 
t c = 2000/08/17, m = 1.32, u = 13.72, V = 1.31, A t = 545, A w = -10.28, A = 7.26, 
B = -9.505 x 10~ 5 , and C = -4.224 x 10~ 5 with the r.m.s. of fit residuals being 
0.0403. 
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